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FIELD OF THE INVENTION 

The present invention relates to vaccine formulations 
comprising papilloma virus proteins, either as fusion proteins, truncated 
proteins, or truncated fusion proteins. The invention further embraces 
methods for producing capsomeres of the formulations, as well as 
prophylactic and therapeutic methods for their use. 

BACKGROUND 

Infections with certain high-risk strains of genital papilloma 
viruses in humans (HPV) - for example, HPV 16, 18, or 45 — are 
believed to be the main risk factor for the formation of malignant tumors of 
the anogenital tract. Of the possible malignancies, cervical carcinoma is by 
far the most frequent: according to an estimate by the World Health 
Organization (WHO), almost 500,000 new cases of the disease occur 
annually. Because of the frequency with which this pathology occurs, the 
connection between HPV infection and cervical carcinoma has been 
extensively examined, leading to numerous generalizations. 

For example, precursor lesions of cervical intraepithelial 
neoplasia (CIN) are known to be caused by papilloma virus infections 
[Crum, New Eng. J. Med. J7ft:880-883 (1984)]. DNA from the g enomes 
of certain HPV types, including for example, strains 16, 18, 33, 35, and 
45, have been detected in more than 95% of tumor biopsies from patients 
with this disorder, as well as in primary cell lines cultured from the 
tumors. Approximately 50 to 70% of the biopsied CIN tumor cells have 
been found to include DNA derived only from HPV 16. 

The protein products of the HPV 16 and HPV 18 early genes 
E6 and E7 have been detected in cervical carcinoma ceil lines as well as in 



human keratinocytes transformed in vitro [Wettstein, et al. , in Papilloma 
Viruses and Human Cancer, Pfister (Ed.).CRC Press: Boca Raton, FL 
1990 pp 155-179] and a significant percentage of patients with cervical 
carcinoma have anti-E6 or anti-E7 antibodies. The E6 and E7 proteins 
have been shown to participate in induction of cellular DNA synthesis in 
human cells, transformation of human keratinocytes and other cell types, 
and tumor formation in transgenic mice [Arbeit, et al. , J. Virol. , 65:4358- 
4364 (1994); Auewarakul, et al, Mol. Cell Biol. 74:8250-8258 (1994); 
Barbosa, et al., J. Virol. 65:292-298 (1991); Kaur, etal, J. Gen. Virol 
70:1261-1266 (1989); Schlegel, etal, EMBO J. , 7:3181-3187 (1988)]. 
The constitutive expression of the E6/E7 proteins appears to be necessary 
to maintain the transformed condition of HPV-positive tumors. 

Despite the capacity of some HPV strains to induce 
neoplastic phenotypes in vivo and in vitro, still other HPV types cause 
benign genital warts such as condylomata acuminata and are only rarely 
associated with malignant tumors (Tkenberg, In Gross, et al., (eds.) 
Genital Papillomavirus Infections. Springer Verlag: Berlin, pp., 87- 
1 12]. Low risk strains of this type include, for example, HPV 6 and 11. 

Most often, genital papilloma viruses are transmitted between 
humans during intercourse which in many instances leads to persistent 
infection in the anogenital mucous membrane. While this observation 
suggests that either the primary infection induces an inadequate immune 
response or that the virus has developed the ability to avoid immune 
surveillance, other observations suggest that the immune system is active 
during primary manifestation as well as during malignant progression of 
papilloma virus infections [Altmann et al. in Viruses and Cancer, 
Minson et al., (eds.) Cambridge University Press, (1994) pp. 71-80]. 

For example, the clinical manifestation of primary infection 
by rabbit and bovine papilloma virus can be prevented by vaccination with 
wart extracts or viral structural proteins [Altmann, et al. , supra; Campo, 
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Curr. Top. In Microbiol and Immunol. 786:255-266 (1994); Yindle and 
Frazer, Curr. Top. In Microbiol, and Immunol. I86;2ll-253 (1994)]. 
Rodents previously vaccinated with vaccinia recombinants encoding HPV 
16 early proteins E6 or E7, or with synthetic E6 or E7 peptides, are 
5 similarly protected from tumor formation after inoculation of HPV 16 

transformed autologous cells [Altman, et al, supra; Campo, et al., supra; 
Yindle and Frazer, et al. supra}. Regression of warts can be induced by 
the transfer of lymphocytes from regressor animals following infection by 
animal papilloma viruses. Finally, in immunosuppressed patients, such as, 
10 for example, recipients of organ transplants or individuals infected with 
HIV, the incidence of genital warts, CIN, and anogenital cancer is 
elevated. 

To date, no HPV vaccinations have been described which 
comprise human papilloma virus late LI protein in the form of capsomeres 

15 which are suitable both for prophylactic and therapeutic purposes. Since 
the LI protein is not present in malignant genital lesions, vaccination with 
LI protein does not have any therapeutic potential for these patients. 
Construction of chimeric proteins, comprising amino acid residues from LI 
protein and, for example E6 or E7 protein, which give rise to chimeric 

20 capsomeres, combines prophylactic and therapeutic functions of a vaccine. 
A method for high level production of chimeric capsomeres would 
therefore be particularly desirable, in view of the possible advantages 
offered by such a vaccine for prophylactic and therapeutic intervention. 

Thus there exists a need in the art to provide vaccine 

25 formulations which can prevent or treat HPV infection. Methods to 

produce vaccine formulations which overcome problems known in the art 
to be associated with recombinant HPV protein expression and purification 
would manifestly be useful to treat the population of individuals already 
infected with HPV as well as useful to immunize the population of 

30 individuals susceptible to HPV infection. 



SUMMARY OF THE INVENTION 

The present invention provides therapeutic and prophylactic 
vaccine formulations comprising chimeric human papilloma capsomeres. 
The invention also provides therapeutic methods for treating patients 
infected with an HPV as well as prophylactic methods for preventing HPV 
infection in a susceptible individual. Methods for production and 
purification of capsomeres and proteins of the invention are also 
contemplated. 

In one aspect of the invention, prophylactic vaccinations for 
prevention of HPV infection are considered which incorporate the structural 
proteins LI and L2 of the papilloma virus. Development of a vaccine of 
this type faces significant obstacles because papilloma viruses cannot be 
propagated to adequate titers in cell cultures or other experimental systems 
to provide the viral proteins in sufficient quantity for economical vaccine 
production. Moreover, recombinant methodologies to express the proteins 
are not always straightforward and often results in low protein yield. 
Recently, virus-like particles (VLPs), similar in make up to viral capsid 
structures, have been described which are formed in Sf-9 insect cells upon 
expression of the viral proteins LI and L2 (or LI on its own) using 
recombinant vaccinia or baculovirus. Purification of the VLPs can be 
achieved very simply by means of centrifugation in CsCl or sucrose 
gradients [Kimbauer. et al. . Proc. Natl. Acad. Sci. (USA), 99: 12180-12814 
(1992); Kirnbaurer, et al., J. Virol. 67:6929-6936 (1994); Proso, et al, J. 
Virol. 6774:1936-1944 (1992): Sasagawa, et al., Virology 2076:126-195 
(1995): Volpers, et al., J.Virol. 69:3258-3264 (1995); Zhou, et al, J. 
Gen. Virol. 74:762-769 (1993): Zhou, et al., Virology 755:251-257 
(1991)]. WO 93/02184 describes a method in which papilloma virus-like 
particles (VLPs) are used for diagnostic applications or as a vaccine against 
infections caused by the papilloma virus. WO 94/00152 describes 



recombinant production of LI protein which mimics the conformational 
neutralizing epitope on human and animal papilloma virions. 

In another aspect of the invention, therapeutic vaccinations 
are provided to relieve complications of, for example, cervical carcinoma 
or precursor lesions resulting from papilloma virus infection, and thus 
represent an alternative to prophylactic intervention. Vaccinations of this 
type may comprise early papilloma virus proteins, principally E6 or E7, 
which are expressed in the persistently infected cells. It is assumed that, 
following administration of a vaccination of this type, cytotoxic T-cells 
might be activated against persistently infected cells in genital lesions. The 
target population for therapeutic intervention is patients with HPV- 
associated pre-malignant or malignant genital lesions. PCT patent 
application WO 93/20844 discloses that the early protein E7 and antigenic 
fragments thereof of the papilloma virus from HPV or BPV is 
therapeutically effective in the regression, but not in the prevention, of 
papilloma virus tumors in mammals. While early HPV proteins have been 
produced by recombinant expression in E. coli or suitable eukaryotic cell 
types, purification of the recombinant proteins has proven difficult due to 
inherent low solubility and complex purification procedures which generally 
require a combination of steps, including ion exchange chromatography, gel 
filtration and affinity chromatography. 

According to the present invention, vaccine formulations 
comprising papilloma virus capsomeres are provided which comprise 
either: (i) a first protein that is an intact viral protein expressed as a fusion 
protein comprised in part of amino acid residues from a second protein; (ii) 
a truncated viral protein; (iii) a truncated viral protein expressed as a fusion 
protein comprised in part of amino acid residues from a second protein, or 
(iv) some combination of the three types of proteins. According to the 
invention, vaccine formulations are provided comprising capsomeres of 
bovine papilloma virus (BPV) and human papilloma virus. Preferred 
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bovine virus capsomeres comprise protein from bovine papilloma virus type 
I. Preferred human virus capsomeres comprise proteins from any one of 
human papilloma virus strains HPV6, HPV11, HPV16, HPV18, HPV33, 
HPV35, and HPV45. The most preferred vaccine formulations comprise 
5 capsomeres comprising proteins from HPV16. 

In one aspect, capsomere vaccine formulations of the 
invention comprise a first intact viral protein expressed as a fusion protein 
with additional amino acid residues from a second protein. Preferred intact 
viral proteins are the structural papilloma viral proteins LI and L2. 

10 Capsomeres comprised of intact viral protein fusions may be produced 

using the LI and L2 proteins together or the LI protein alone. Preferred 
capsomeres are made up entirely of LI fusion proteins, the amino acid 
sequence of which is set out in SEQ ID NO: 2 and encoded by the 
polynucleotide sequence of SEQ ID NO: 1 . Amino acids of the second 

15 protein can be derived from numerous sources (including amino acid 

residues from the first protein) as long as the addition of the second protein 
amino acid residues to the first protein permits formation of capsomeres. 
Preferably, addition of the second protein amino acid residues inhibits the 
ability of the intact viral protein to form virus-like particle structures; most 

20 preferably, the second protein amino acid residues promote capsomere 

formation. In one embodiment of the invention, the second protein may be 
any human tumor antigen, viral antigen, or bacterial antigen which is 
important in stimulating an immune response in neoplastic or infectious 
disease states. In a preferred embodiment, the second protein is also a 

25 papilloma virus protein. It also preferred that the second protein be the 
expression product of papilloma virus early gene. It is also preferred, 
however, that the second protein be selected from group of El, E2, E3, 
E4, E5, E6, and E7 - early gene products encoded in the genome of 
papilloma virus strains HVP6, HPV11, HPV18, HPV33, HPV35, or HPV 

30 45. It is most preferred that the second protein be encoded by the HPV 16 



E7 gene, the open reading frame of which is set out in SEQ ID NO: 3. 
Capsomeres assembled from fusion protein subunits are referred to herein 
as chimeric capsomeres. In one embodiment, the vaccine formulation of 
the invention is comprised of chimeric capsomeres wherein LI protein 
amino acid residues make up approximately 50 to 99% of the total fusion 
protein amino acid residues. In another embodiment, LI amino acid 
residues make up approximately 60 to 90% of the total fusion protein 
amino acid residues; in a particularly preferred embodiment, LI amino 
acids comprise approximately 80% of the fusion protein amino acid 
residues. 

In another aspect of the invention, capsomere vaccine 
formulations are provided that are comprised of truncated viral proteins 
having a deletion of one or more amino acid residues necessary for 
formation of a virus-like particle. It is preferred that the amino acid 
deletion not inhibit formation of capsomeres by the truncated protein, and it 
is most preferred that the deletion favor capsomere formation. Preferred 
vaccine formulations of this type include capsomeres comprised of 
truncated LI with or without L2 viral proteins. Particularly preferred 
capsomeres are comprised of taincated LI proteins. Truncated proteins 
contemplated by the invention include those having one or more amino acid 
residues deleted from the carboxy terminus of the protein, or one or more 
amino acid residues deleted from the amino terminus of the protein, or one 
or more amino acid residues deleted from an internal region (i.e., not from 
either terminus) of the protein. Preferred capsomere vaccine formulations 
are comprised of proteins truncated at the carboxy terminus. In 
formulations including LI protein derived from HPV16, it is preferred that 
from 1 to 34 carboxy terminal amino acid residues are deleted. Relatively 
shorter deletions are also contemplated which offer the advantage of minor 
modification of the antigenic properties of the LI proteins and the 
capsomeres formed thereof. It is most preferred, however, that 34 amino 



- 8 - 

acid residues be deleted from the LI sequence, corresponding to amino 
acids 472 to 505 in HPV16 set out in SEQ ID NO: 2, and encoded by the 
polynucleotide sequence corresponding to nucleotides 1414 to 1516 in the 
human HPV16 LI coding sequence set out in SEQ ID NO: 1. 
5 When a capsomere vaccine formulation is made up of 

proteins bearing an internal deletion, it is preferred that the deleted amino 
acid sequence comprise the nuclear localization region of the protein. In 
the LI protein of HPV 16, the nuclear localization signal is found from 
about amino acid residue 499 to about amino acid residue 505. Following 

10 expression of LI proteins wherein the NLS has been deleted, assembly of 
capsomere structures occurs in the cytoplasm of the host cell. 
Consequently, purification of the capsomeres is possible from the 
cytoplasm instead of from the nucleus where intact LI proteins assemble 
into capsomeres. Capsomeres which result from assembly of truncated 

15 proteins wherein additional amino acid sequences do not replace the deleted 
protein sequences are necessarily not chimeric in nature. 

In still another aspect of the invention, capsomere vaccine 
formulations are provided comprising truncated viral protein expressed as a 
fusion protein adjacent amino acid residues from a second protein. 

20 Preferred truncated viral proteins of the invention are the structural 

papilloma viral proteins LI and L2. Capsomeres comprised of truncated 
viral protein fusions may be produced using LI and L2 protein components 
together or LI protein alone. Preferred capsomeres are those comprised of 
LI protein amino acid residues. Truncated viral protein components of the 

25 fusion proteins include those having one or more amino acid residues 

deleted from the carboxy terminus of the protein, or one or more amino 
acid residues deleted from the amino terminus of the protein, or one or 
more amino acid residues deleted from an internal region (i.e., not from 
either terminus) of the protein. Preferred capsomere vaccine formulations 

30 are comprised of proteins truncated at the carboxy terminus. In those 



formulations including LI protein derived from HPV16, it is preferred that 
from 1 to 34 carboxy terminal amino acid residues are deleted. Relatively 
shorter deletions are also contemplated that offer the advantage of minor 
modification of the antigenic properties of the LI protein component of the 
fusion protein and the capsomeres formed thereof. It is most preferred, 
however, that 34 amino acid residues be deleted from the LI sequence, 
corresponding to amino acids 472 to 505 in HPV16 set out in SEQ ID NO: 
2, and encoded by the polynucleotide sequence corresponding to 
nucleotides 1414 to 1516 in the human HPV16 LI coding sequence set out 
in SEQ ID NO: 1. When the vaccine formulation is comprised of 
capsomeres made up of proteins bearing an internal deletion, it is preferred 
that the deleted amino acid sequence comprise the nuclear localization 
region, or sequence, of the protein. 

Amino acids of the second protein can be derived from 
numerous sources as long as the addition of the second protein amino acid 
residues to the first protein permits formation of capsomeres. Preferably, 
addition of the second protein amino acid residues promotes or favors 
capsomere formation. Amino acid residues of the second protein can be 
derived from numerous sources, including amino acid residues from the 
first protein. In a preferred embodiment, the second protein is also a 
papilloma virus protein. It also preferred that the second protein be the 
expression product of papilloma virus early gene. It is most preferred, 
however, that the second protein be selected from group of early gene 
products encoding by papilloma vims El. E2, E3, E4, E5, E6, and E7 
genes. In one embodiment, the vaccine formulation of the invention is 
comprised of chimeric capsomeres wherein LI protein amino acid residues 
make up approximately 50 to 99% of the total fusion protein amino acid 
residues. In another embodiment, LI amino acid residues make up 
approximately 60 to 90% of the total fusion protein amino acid residues; in 
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a particularly preferred embodiment, LI amino acids comprise 
approximately 80 % of the fusion protein amino acid residues. 

In a preferred embodiment of the invention, proteins of the 
vaccine formulations are produced by recombinant methodologies, but in 
5 formulations comprising intact viral protein, the proteins may be isolated 
from natural sources. Intact proteins isolated from natural sources may be 
modified in vitro to include additional amino acid residues to provide a 
fusion protein of the invention using covalent modification techniques well 
known and routinely practiced in the art. Similarly, in formulations 

10 comprising truncated viral proteins, the proteins may be isolated from 
natural sources as intact proteins and hydrolyzed in vitro using chemical 
hydrolysis or enzymatic digestion with any of a number of site-specific or 
general proteases, the truncated protein subsequently modified to include 
additional amino acid resides as described above to provide a truncated 

15 fusion protein of the invention. 

In producing capsomeres. recombinant molecular biology 
techniques can be utilized to produce DNA encoding either the desired 
intact protein, the truncated protein, or the truncated fusion protein. 
Recombinant methodologies required to produce a DNA encoding a desired 

20 protein are well known and routinely practiced in the art. Laboratory 

manuals, for example Sambrook. ei a!., (eds.), Molecular Cloning: A 
Laboratory Manual. Cold Spring Harbor Press: Cold Spring Harbor, 
NY (1989) and Ausebel et al. . (eds.). Protocols In Molecular 
Biology. John Wiley & Sons. Inc. (1994-1997), describe in detail 

25 techniques necessary to carry out the required DNA manipulations. For 
large-scale production of chimeric capsomeres, protein expression can be 
carried out using either viral or eukaryotic vectors. Preferable vectors 
include any of the well known prokaryotic expression vectors, recombinant 
baculoviruses, COS cell specific vectors, vaccinia recombinants, or yeast- 

30 specific expression constructs. When recombinant proteins are used to 



provide capsomeres of the invention, the proteins may first be isolated from 
the host cell of its expression and thereafter incubated under conditions 
which permit self-assembly to provide capsomeres. Alternatively, the 
proteins may be expressed under conditions wherein capsomeres are formed 
in the host cell. 

The invention also contemplates processes for producing 
capsomeres of the vaccine formulations. In one method, LI proteins are 
expressed from DNA encoding six additional histidines at the carboxy 
terminus of the LI protein coding sequence. LI proteins expressed with 
additional histidines (His LI proteins) are most preferably expressed in E. 
coli and the His LI proteins can be purified using nickel affinity 
chromatography. His LI proteins in cell lysate are suspended in a 
denaturation buffer, for example. 6 M guanidine hydrochloride or a buffer 
of equivalent denaturing capacity, and then subjected to nickel 
chromatography. Protein eluted from the nickel chromatography step is 
renatured, for example in 150 mM Nad. 1 mM CaCl 2 , 0.01% Triton-X 
100, 10 mM HEPES (N-2-hydroxyethyl piperazine-N'-2 ethane sulfonic 
acid), pH 7.4. According to a preferred method of the invention, assembly 
of capsomeres takes place after dialysis of the purified proteins, preferably 
after dialysis against 150 mM NaCl. 25 mM Ca 2+ , 10% DMSO (dimethyl 
sulfoxide). 0.1% Triton-X 100. 10 mM Tris [tris-(hydroxymethyl) amino- 
methane] acetic acid with a pH value of 5.0. 

Formation of capsomeres can be monitored by electron 
microscopy, and, in instances wherein capsomeres are comprised of fusion 
proteins, the presence of various protein components in the assembled 
capsomere can be confirmed by Western blot analysis using specific 
antisera. 

According to the present invention, methods are provided for 
therapeutic treatment of individuals infected with HPV comprising the step 
of administering to a patient in need thereof an amount of a vaccine 
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formulation of the invention effective to reduce the level of HPV infection. 
The invention also provide methods for prophylactic treatment of 
individuals susceptible to HPV infection comprising the step of 
administering to an individual susceptible to HPV infection an amount of a 
vaccine formulation of the invention effective to prevent HPV infection. 
While infected individuals can be easily identified using standard diagnostic 
techniques, susceptible individuals may be identified, for example, as those 
engaged in sexual relations with an infected individual. However, due to 
the high frequency of HPV infection, all sexually active persons are 
susceptible to papilloma virus infection. 

Administration of a vaccine formulation can include one or 
more additional components such as pharmaceutically acceptable carriers, 
diluents, adjuvants, and/or buffers. Vaccines may be administered at a 
single time or at multiple times. Vaccine formulation of the invention may 
be delivered by various routes including, for example, oral, intravenous, 
intramuscular, nasal, rectal, transdermal, vaginal, subcutaneous, and 
intraperitoneal administration. 

Vaccine formulations of the invention offer numerous 
advantages when compared to conventional vaccine preparations. As part 
of a therapeutic vaccination, capsomeres can promote elimination of 
persistently infected cells in. for example, patients with CIN or cervical 
carcinoma. Additionally, therapeutic vaccinations of this type can also 
serve a prophylactic purpose in protecting patients with CIN lesions from 
re-infection. As an additional advantage, capsomeres can escape 
neutralization by pre-existing anticapsid antibodies and thereby posses 
longer circulating half-life as compared to chimeric virus-like particles. 

Vaccine formulations comprising chimeric capsomeres can 
provide the additional advantage of increased antigenicity of both protein 
components of the fusion protein from which the capsomere is formed. 
For example, in a VLP, protein components of the underlying capsomere 
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may be buried in the overall structure as a result of internalized positioning 
within the VLP itself. Similarly, epitopes of the protein components may 
be sterically obstructed as a result of capsomere-to-capsomere contact, and 
therefore unaccessible for eliciting an immune response. Preliminary 
results using L1/E7 fusion proteins to produce VLPs support this position 
in that no antibody response was detected against the E7 component. This 
observation is consistent with previous results which indicate that the 
carboxy terminal region of LI forms inter-pentameric arm structures that 
allow assembly of capsomeres into capsids [Garcia, et al, J. Virol. 71: 
2988-2995 (1997)]. Presumably in a chimeric capsomere structure, both 
protein components of the fusion protein substructure are accessible to 
evoke an immune response. Capsomere vaccines would therefore offer the 
additional advantage of increased antigenicity against any protein 
component, including, for example, neutralizing epitopes from other virus 
proteins, expressed as a fusion with LI amino acid sequences. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is illustrated by the following 
examples. Example 1 describes construction of expression vectors to 
produce fusion, or chimeric, viral proteins. Example 2 relates to 
generation of recombinant baculovi ruses for expression of viral proteins. 
Example 3 addresses purification of capsomeres. Example 4 describes an 
immunization protocol for production of antisera and monoclonal 
antibodies. Example 5 provides a peptide ELISA to quantitate capsomere 
formation. Example 6 describes an antigen capture ELISA to quantitate 
capsomere formation. Example 7 provides a hemagglutinin assay to assay 
for the induction of neutralizing antibodies. 
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Example 1 
Construction of Chimeric LI Genes 

DNA encoding the HPV 16 LI open reading frame was excised 

from plasmid 16-1 14/k-Ll/L2-pSynxtVT [Kirnbauer et al., J. Virol. 

67:6929-6936 (1994)] using BglR and the resulting fragment subcloned into 

pUC19 (New England Biolabs, Beverly, MA) previously linearized at the 

unique BamlU restriction site. Two basic expression constructs were first 

generated to permit subsequent insertion of DNA to allow fusion protein 

expression. One construct encoded HPV 16 LI a3 10 having a nine amino 

acid deletion; the deleted region was known to show low level homology 

with all other papilloma virus LI proteins. The second construct, HPV 16 

LI aC, encoded a protein having a 34 amino acid deletion of the carboxy 

terminal LI residues. Other constructs include an EcoRV restriction site at 

the position of the deletion for facilitated insertion of DNA encoding other 

protein sequences. Addition of the EcoRV site encodes two non-Ll protein 

amino acids, aspartate and isoleucine. 

A. Generation of an HPV 16 L1a310 expression construct 

Two primers (SEQ ID NOs: 5 and 6) were designed to 
amplify the pUC19 vector and the complete HPV 16 LI coding sequence, 
except nucleotides 916 through 942 in SEQ ID NO: 1. Primers were 
synthesized to also introduce a unique EcoRV restriction site (underlined in 
SEQ ID NOs: 5 and 6) at the termini of the amplification product. 

CCCC GATATC GCCTTTAATGTATA A ATCGTCTGG 
SEQ ID NO: 5 

CC C CGATATCTC A A ATTATTTTCCT AC AC CT AGTG 
SEQ ID NO: 6 

The resulting PCR product was digested with EcoRV to provide 
complementary ends and the digestion product circularized by ligation. 
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Ligated DNA was transformed into E. coli using standard techniques and 
plasmids from resulting colonies were screened for the presence of an 
EcoRV restriction site. One clone designated HPV 16 LI a310 was 
identified as having the appropriate twenty-seven nucleotide deletion and 
this construct was used to insert DNA fragments encoding other HPV 1 6 
proteins at the EcoRV site as discussed below. 

B. Generation of an HPV 16 LI aC expression constructs 

Two primers (SEQ ID NOs: 7 and 8) were designed 
complementary to the HPV 16 LI open reading frame such that the primers 
abutted each other to permit amplification in reverse directions on the 
template DNA comprising HPV 16 LI -encoding sequences in pUC19 
described above. 

AAA GATATC TTGTAGTAAAAATTTGCGTCCTAAAGGAAAC 
SEQ ID NO: 7 

AAAGAJATCTAATCTACCTCTACAACTGCTAAACGCAAAAAACG 
SEQ ID NO: 8 

Each primer introduced an EcoRV restriction site at the terminus of the 
amplification product. In the downstream primer (SEQ ID NO: 8), the 
.EcoRV site was followed by a TAA translational stop codon positioned 
such that the amplification product, upon ligation of the EcoRV ends to 
circularize, would include deletion of the 34 carboxy terminal LI amino 
acids. PCR was performed to amplify the partial LI open reading frame 
and the complete vector. The amplification product was cleaved with 
EcoRV, circularized with T4 DNA iigase, and transformed into E. coli 
DH5 a cells. Plasmids from viable clones were analyzed for the presence 
of an EcoRV site which would linearize the plasmid. One positive 



- 16 - 

construct designated pUCHPV16L1aC was identified and used to insert 
DNA from other HPV 16 proteins utilizing the EcoRV site. 

C. Insertion of DNA fragments into HPV 16 LI a310 and HPV16L1aC 

DNA fragments of HPV 16 E7 encoding amino acids 1-50, 
1-60. 1-98, 25-75, 40-98, 50-98 in SEQ ID NO: 4 were amplified using 
primers that introduced terminal 5' EcoRV restriction sites in order to 
facilitate insertion of the fragment into either HPV 16 LI a310 and 
HPV16L1aC modified sequence. In the various amplification reactions, 
primer E7.1 (SEQ ID NO: 9) was used in combination with primer E7.2 
(SEQ ID NO: 10) to generate a DNA fragment encoding E7 amino acids 1- 
50: with primer E7.3 (SEQ ID NO: 11) generate a DNA fragment 
encoding E7 amino acids 1-60: or with primer E7.4 (SEQ ID NO: 12) 
generate a DNA fragment encoding E7 amino acids 1-98. In other 
amplification reactions, primer pairs E7.5 (SEQ ID NO: 13) and E7.6 
(SEQ ID NO: 14) were used to amplify a DNA fragment encoding E7 
amino acids 25-75: E7.7 (SEQ ID NO: 15) and E7.4 (SEQ ID NO: 12) 
were used to amplify a DNA fragment encoding E7 amino acids 40-98; and 
E7.8 (SEQ ID NO: 16) and E7.4 (SEQ ID NO: 12) were used to amplify a 
DNA fragment encoding E7 amino acids 50-98. 

Primer E7. 1 SE q id NO; 9 

AAAAGj^TATCATGCATGGAGATACACCTACATTGC 



Primer E1.2 SE q id n0: 10 

TTTTGj^rATCGGCTCTGTCCGGTTCTGCTTGTCC 



Primer E7.3 SE q id NO: 1 1 

TTrTGAIATCCTTGCAACAAAAGGrrACAATATTGTAATGGGCC 
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Primer E7.4 SEQ ID NO: 12 

AAAA GATATC TGGTTTCTGAGAACAGATGGGGCAC 



Primer E7.5 SEQ ID NO: 13 

TTT TGATATC GATTATGAGCAATTAAATGACAGCTCAG 



Primer E7.6 SEQ ID NO: 14 

I'l'l'l GATATC GTCTACGTGTGTGCTTTGTACGCAC 



Primer E7.7 SEQ ID NO: 15 

TTTATC GATATC GGTCCAGCTGGACAAGCAGAACCGGAC 



Primer E7.8 SEQ ID NO: 16 

10 TTTTGj^TATCGATGCCCATTACAATATTGTAACCTTTTG 



Similarly, nucleotides from DNA encoding the influenza 
matrix protein (SEQ ID NO: 17) was amplified using the primer pair set 
out in SEQ ID NOs: 19 and 20. Both primers introduced an EcoRV 
restriction site in the amplification product. 



1 5 TTTTGATATCGATATGGAATGGCTAAAG ACAAGACCAATC 

SEQ ID NO: 19 

TTTTGATATC GTTGTTTG G ATC C C C ATTC C C ATTG 

SEQ ID NO: 20 



PCR products from each amplification reaction were cleaved 
20 with EcoRV and inserted into the EcoRV site of either the HPV 16 LI 
a310 and HPV16L1aC sequences previously linearized with the same 
enzyme. In order to determine the orientation of inserts in plasmids 
encoding E7 amino acids 25-75 and 50-98 and plasmid including influenza 
matrix protein, Clal digestion was employed, taking advantage of a 
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restriction site overlapping the newly created EcoRV restriction site 
(GATATCGAT) and included in the upstream primer. For the three 
expression constructs including the initiating methionine of HPV16 E7, 
insert orientation was determined utilizing a NslL restriction site within the 
E7 coding region. 

Once expression constructs having appropriate inserts were 
identified, the protein coding region for both LI and inserted amino acids 
was excised as a unit using restriction enzymes Xbal and SmcH and the 
isolated DNA ligated into plasmid pVL1393 (Invitrogen) to generate 
recombinant baculoviruses. 



D. Elimination of EcoRV Restriction Sites in Expression Constructs 

The HPV 16 LI aC sequence includes DNA from the 
EcoRV site that results in translation of amino acids not normally found in 
wild-type LI polypeptides. Thus, a series of expression constructions was 
designed in which the artificial EcoRv site was eliminated. The LI 
sequence for this series of expression constructs was designated HPV 
16LUC*. 

To generate an expression construct containing the HPV 
16LUC sequence, two PCR reactions were performed to amplify two 
overlapping fragments from the pUC-HPV16 LIaC encoding E7 amino 
acids 1-50. The resulting DNA fragments overlapped at the position of the 
L1/E7 boundary but did not contain the two EcoRV restriction sites. 
Fragment 1 was generated using primers PI (SEQ ID NO: 21) and P2 
(SEQ ID NO: 22) and fragment 2 using primers P3 (SEQ ID NO: 23) and 
P4 (SEQ ID NO: 24). 



Primer PI SEQ ID NO: 21 

GTTATGACATACATACATTCTATG 



SEQ ID NO: 22 
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CCATGCATTCCTGCTTGTAGTAAAAATTTGCGTCC 

Primer P3 SEQ ID NO: 23 

CTACAAGCAGGAATGCATGGAGATACACC 

Primer P4 SEQ ID NO: 24 

CATCTGAAGCTTAGTAATGGGCTCTGTCCGGTTCTG 



Following the first two amplification reactions, the two 
purified products were used as templates in another PCR reaction using 
primers PI and P4 only. The resulting amplification product was digested 
with enzymes EcoNI and HindHI inserted into the HPV 16L1aC 
expression construct described above following digestion with the same 
enzymes. The resulting expression construct differed from the original 
HPV16L1aC construct with DNA encoding LI and E7 amino acids 1-50 by 
loss of the two internal EcoKV restriction sites. The first EcoKV site was 
replaced by DNA encoding native LI alanine and glycine amino acids in 
this position and the second was replaced by a translational stop signal. In 
addition, the expression constnict. designated HPV 16 LIaC* E7 1-52, 
contained the first 52 amino acids of HPV 16 E7 as a result of using 
primer P4 which also encodes E7 amino acids residues histidine at position 
51 and tyrosine at position 52. HPV 16 LIaC* E7 1-52 was then used to 
generate additional HPV 16 LIaC expression constructs further including 
DNA encoding E7 amino acids 1-55 using primer PI (SEQ ID NO: 21) in 
combination with primer P5 (SEQ ID NO: 25), E7 amino acids 1-60 with 
primer pair PI and P6 (SEQ ID NO: 26), and E7 amino acids 1-65 with 
primer pair PI and P7 (SEQ ID NO: 27). The additional animo acid- 
encoding DNA sequences in the amplification products arose from design 
of the primers to include additional nucleotides for the desired amino acids. 
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Primer P5 SEQ ED NO: 25 

CATCTGAAGCTTAACAATATTGTAATGGGCTCTGTCCG 



Primer P6 SEQ ID NO: 26 

CATCTGAAGCTTACTTGCAACAAAAGGTTA- 

CAATATTGTAATGGGCTCTGTCCG 



Primer P7 SEQ ID NO: 27 

CATCTGAAGCTTAAAGCGTAGAGTCACACTTGCAAC- 

AAAAGGTTACAATATTGTAATGGGCTCTGTCCG 



Similarly, HPV 16 LIaC E7 1-70 was generated using template DNA 
encoding HPV 16 LIaC* E7 1-66 and the primer pair PI and P8 (SEQ ID 
NO: 28). 



Primer P8 SEQ ID NO: 28 

CATCTGAAGCTTATTGTACGCACAAC- 

C G A AGCGTAG AGTC AC ACTTG 



Following each PCR reaction, the amplification products were digested 
with EcoNl and NindUl and inserted into HPV16L1&C previously digested 
with the same enzymes. Sequences of each constructs were determined 
using an Applied Biosystems Prism 377 sequencing instrument with 
fluorescent chain terminating dideoxynucleotides [Prober et al, Science 
238: 336-341 (1987)]. 

Example 2 

Generation of Recombinant Baculoviruses 

Spodoptera frugiperda (Sf9) cells were grown in suspension or 
monolayer cultures at 27° in TNMFH medium (Sigma) supplemented with 
10% fetal calf serum and 2 mM glutamine. For HPV 16 LI -based 
recombinant baculovirus construction, Sf9 cells were transfected with 10 fig 
of transfer plasmid together with 2 fig of linearized Baculo-Gold DNA 



(PharMingen, San Diego, CA). Recombinant viruses were purified by 
according to manufacturer's suggested protocol. 

To test for expression of HPV 16 LI protein, 10 5 Sf9 cells 
were infected with baculovirus recombinant at a multiplicity of infection 
(m.o.i) of 5 to 10. After incubation for three to four days at 28°C, media 
was removed and cells were washed with PBS. The cells were lysed in 
SDS sample buffer and analyzed by SDS-PAGE and Western blotting using 
anti-HPV16 LI and anti-HPV16 E7 antibodies. 

In order to determine which of the chimeric LI protein 
expression constructs would preferentially produce capsomeres, extracts 
from transfected cells were subjected to gradient centrifugation. Fractions 
obtained from the gradient were analyzed for LI protein content by 
Western blotting and for VLP formation by electron microscopy. The 
results are shown in Table 1 . 

The intact HPV LI protein, as well as the expression 
products HPV 16 LI A3 10 and HPV 16 LI AC, each were shown to produce 
capsomeres and virus-like panicles in equal proportions. When E7 coding 
sequences were inserted into the HPV 16 LI A3 10 vector, only fusion 
proteins including E7 amino acids 1 to 50 produced gave rise to detectable 
capsomere formation. 

When E7 encoding DNA was inserted into the HPV 16 
LI AC vector, all fusion proteins were found to produce capsomeres; 
chimeric proteins including E7 amino acid residues 40-98 produced the 
highest level of exclusively capsomere structures. Chimeric proteins 
including E7 amino acids 1-98 and 25-75 both produced predominantly 
capsomeres, even thorough virus-like particle formation was also observed. 
The chimeric protein including E7 amino acids 1-60 resulted in nearly 
equal levels of capsomere and virus-like particle production. 

When E7 sequences were inserted into the HPV 16 L1A*C 
vector, all fusion proteins were shown to produce capsomeres. Insertion of 
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DNA encoding E7 residues 1-52, 1-55, and 1-60 produced the highest level 
of capsomeres, but equal levels of virus-like particle production were 
observed. While insertion of DNA encoding E7 DNA for residues 1-65, 1- 
70, 25-75, 40-98, and 1-98 resulted in comparatively lower levels or 
undetectable levels of capsid, capsomeres were produced in high quantities. 



TABLE 1 

Capsomeree and Capsid Forming Capacity of 
Chimeric HPV LI Proteins 



L 1 Expression 
Construct 



HVP 16 LI 
HPV 16 L1a310 
HPV 16 LUC 
HPV 16 LU310 
HPV 16 L1a3I0 
HPV 16 LU310 
HPV 16 LU310 
HPV 16 LUC 
HPV 16 LUC 
HPV 16 LUC 
HPV 16 LUC 
HPV 16 LUC 
HPV 16 LUC 
HPV 16 LU*C 
HPV 16 LU*C 
HPV 16 LU*C 
HPV 16 LU*C 
HPV 16LU*C 



E7 1-98 

E7 1-50 
E7 25-75 
E7 50-98 

E7 1-98 
E7 25-75 
E7 50-98 

E7 1-60 
El 40-98 
Influenza 
E7 1-52 
E7 1-55 
E7 1-60 
E7 1-65 
E7 1-70 



Capsomere 
Yield 



+ + + 
+ + + + 



+ + + 
+ + + 



+ + + + 
+ + + 
+ + + + + 
+ + + + + 
+ + + 
+ + 
+ + 



Capsid 
Yield 



+ + + + + 
+ + + + + 
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Example 3 
Purification of Capsomeres 

Trichopulsia ni (TN) High Five cells were grown to a density of 

approximately 2 x 10 6 cells/ml in Ex-Cell 405 serum-free medium (JRH 

Biosciences). Approximately 2 x 10 8 cells were pelleted by centrifugation at 1000 

x g for 15 minutes, resuspended in 20 ml of medium, and infected with 

recombinant baculoviruses at m.o.i of 2 to 5 for 1 hour at room temperature. 

After addition of 200 ml medium, cells were plated and incubated for 3 to 4 days 

at 27°C. Following incubation, cells were harvested, pelleted, and resuspended in 

10 ml of extraction buffer. 

The following steps were performed at 4°C. Cells were sonicated 

for 45 seconds at 60 watts and the resulting cell lysate was centrifuged at 10,000 

rpm in a Sorval SS34 rotor. The supernatant was removed and retained while the 

resulting pellet was resuspended in 6 ml of extraction buffer, sonicated for an 

additional 3 seconds at 60 watts, and centrifuged again. The two supernatants 

were combined, layered onto a two-step gradient containing 14 ml of 40% sucrose 

on top of 8 ml of CsCl solution (4.6 g CsCi per 8 ml in extraction buffer), and 

centrifuged in a Sorval AH629 swinging bucket rotor for 2 hours at 27,000 rpm at 

10°C. The interface region between the CsC! and the sucrose along with the CsCl 

complete layer were collected into 13.4 ml Quickseal tubes (Beckman) and 

extraction buffer added to adjust the volume 13.4 ml. Samples were centrifuged 

overnight at 50.000 rpm at 20°C in a Beckman 70 TI rotor. Gradients were 

fractionated (1 ml per fraction) by puncturing tubes on top and bottom with a 21- 

gauge needle. Fractions were collected from each tube and 2.5 fx\ of each fraction 

were analyzed by a 10% SDS-polyacrylamide gel and Western blotting using an 

anti-HPV16 LI antibody. 

Virus-like particles and capsomeres were separated from the fractions 

identified above by sedimentation on 10 to 50% sucrose gradients. Peak fractions 

from CsCl gradients were pooled and dialyzed for 2 hours against 5 mM HEPES 

(pH 7.5). Half of the dialysate was used to produce capsomeres by disassembly of 

intact VLPs overnight by adding EDTA (final concentration 50 mM), EGTA (50 
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mM), DTT (30 mM). NaCl (100 mM), and Tris/HCl, pH 8.0, (10 mM). As 
control, NaCl and Tris/HCl only were added to the other half. 

For analysis of capsomeres produced from disassembled VLPs, 
EDTA, EGTA, and DTT (final concentration 5 mM each) were added to the 
sucrose cushions which were centrifuged at 250,000 x g for 2 to 4 hours at 4°C. 
Fractions were collected by puncturing tubes from the bottom. A 1:10 dilution of 
each fraction was then analyzed by antigen capture ELISA. 

Example 4 
Immunization Protocol for Production of 
Polyclonal Antisera and Monoclonal Antibodies 

Balb/c mice are immunized subcutaneously three times, every four 

weeks with approximately 60 tig of HPV chimeric capsomeres mixed 1:1 with 

complete or incomplete Freund's Adjuvants in a total volume of 100 p\. Six 

weeks after the third immunization, mice are sacrificed and blood is collected by 

cardiac puncture. 

Example 5 

Peptide ELISA to Quantitate Capsomere Formation 

Microtiter plates (Dynatech) are coated overnight with 50 /xl of 
peptide E701 [Muller et at, 1982] at a concentration of 10 /xg/ml in PBS. Wells 
are blocked for 2 hour at 37°C with 100 /xl of buffer containing 5% BSA and 
0.05% Tween 20 in PBS and washed three times with PBS containing 0.05% 
Tween 20. After the third wash. 50 /xl of sera diluted 1:5000 in BSA/Tween 
20/PBS is added to each well and incubation carried out for 1 hour. Plates are 
washed again as before and 50 /xl of goat-anti-mouse peroxidase conjugate is added 
at a 1:5000 dilution. After 1 hour, plates are washed and stained using ABTS 
substrate (0.2 mg/ml, 2.2'-Azino-bis(3-ethylbenzhiazoIine-/S-sulfonic acid in 0.1 M 
Na-Acetate-Phosphate buffer (pH 4.2) with 4 /xl 30% H 2 0 2 per 10 ml). 
Extinction is measured after 1 hour at 490 nm in a Dynatech automated plate 
reader. 
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Example 6 

Antigen Capture ELISA to Quantitate Capsomere Formation 

To allow relative quantification of virus-like particles and 
capsomeres in fractions of CsCl gradients, an antigen capture ELISA was utilized. 
Microtiter plates were coated overnight with 50 ^1/well of a 1:500 dilution (final 
concentration of 2 ng per ml, in PBS) with a protein A purified mouse monoclonal 
antibody immunospecific for HPV 16 LI (antibodies 25/C, MM07 and Ritti 1 
were obtained from mice immunized with HPV 16 VLPs). Plates were blocked 
with 5% milk/PBS for 1 hour and 50 pi of fractions of CsCl gradients were added, 
for 1 hour at 37°C using a 1:300 dilution (in 5% milk/PBS). After three washings 
with PBS/0.05% Tween 20, 50 /d of a polyclonal rabbit antiserum (1:3000 
dilution in milk/PBS), raised against HPV 16 VLPs was added and plates were 
incubated at 37° for 1 hour. Plates were washed again and further incubated with 
50 fx\ of a goat-anti-rabbit peroxidase conjugate (Sigma) diluted 1 :5000 in PBS 
containing 5% milk for 1 hour. After final washing, plates were stained with 
ABTS substrate for 30 minutes and extinction measured at 490 nm in a Dynatech 
automated plate reader. As a negative control, the assay also included wells 
coated only with PBS. 

To test monoclonal antibodies for capsomere specificity, VLPs with 
EDTA/DTT to disassemble particles. Treated particle preparations were assayed 
in the antigen-capture ELISA and readings compared to untreated controls. For 
disassembly, 40 pi of VLPs was incubated overnight at 4°C in 500 /xl of 
disruption buffer containing 30 mM DTT. 50 mM EGTA, 60 mM EDTA, 100 
mM NaCl, and 100 mM Tris/HCI. pH 8.0. Aliquots of treated and untreated 
particles were used in the above capture ELISA in a 1:20-1:40 dilution. 

Example 7 
Hemagglutinin Inhibition Assay 

In order to determine the extent to which chimeric capsomere 

vaccines evoke production of neutralizing antibodies, a hemagglutination inhibition 

assay is carried out as briefly described below. This assay is based on previous 
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observations that virus-like particles are capable of hemagglutinizing red blood 
cells. 

Mice are immunized with any of a chimeric capsomere vaccine and 
sera is collected as described above in Example 4. As positive controls, HPV16 
LI virus like particles (VLPs) and bovine PV1 (BPV) LI VLPs are assayed in 
parallel with a chimeric capsomere preparation. To establish a positive baseline, 
the HPV16 or BPV1 VLPs are first incubated with or without sera collected from 
immunized mice after which red blood cells are added. The extent to which 
preincubation with mouse cera inhibits red blood cell hemagglutinization is an 
indication of the neutralizing capacity of the mouse sera. The experiments are 
then repeated using chimeric capsomeres in order to determine the neutralizing 
effect of the mouse sera on the vaccine. A brief protocol for the hemagglutination 
inhibition assay is described below. 

One hundred microliters of heparin (1000 usp units/ml) are added to 
1 ml fresh mouse blood. Red blood cells are washed three times with PBS 
followed by centrifugation and resuspension in a volume of 10 ml. Next, 
erythrocytes are resuspended in 0.5 ml PBS and stored at 4°C for up to three 
days. For the hemagglutinin assay. 70 \i\ of the suspension is used per well on a 
96-welI plate. 

Chimeric capsomere aliquots from CsCl gradients are dialyzed for 
one hour against 10 mM Hepes (pH 7.5) and 100 fi\ of two-fold serial dilutions in 
PBS are added to mouse erythrocytes in round-bottom 96-well microtiter plates 
which are further incubated for 3-16 hours at 4°C. For hemagglutination 
inhibition, capsomeres are incubated with dilutions of antibodies in PBS for 60 
minutes at room temperature and then added to the erythrocytes. The level of 
erythrocyte hemagglutination, and therefore the presence of neutralizing 
antibodies, is determined by standard methods. 

In preliminary results, mouse sera generated against chimeric 
capsomeres comprising HPV16L1 AC protein in association with E7 amino acid 
residues 1-98 was observed to inhibit hemagglutination by HPV16 VLPs, but not 
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by BPV VLPs. The mouse sera was therefore positive for neutralizing antibodies 
against the human VLPs and this differential neutralization was most likely the 
result of antibody specificity for epitopes against which the antibodies were raised. 

Numerous modifications and variations in the invention as set forth 
in the above illustrative examples are expected to occur to those skilled in the art. 
Consequently only such limitations as appear in the appended claims should be 
placed on the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: 

(ii) TITLE OF INVENTION: Papilloma Virus Capsomere Vaccine Formulations 
and Methods of Use 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & Borun 

(B) STREET: 233 South Wacker Drive, 6300 Sears Tower 
<C) CITY: Chicago 

<D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP: 60606-6402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Williams Jr., Joseph A. 

(B) REGISTRATION NUMBER: 3 8,659 

(C) REFERENCE /DOCKET NUMBER: 27013/34028 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-474-6300 

(B) TELEFAX: 312-474-0448 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1518 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATG TCT CTT TGG CTG CCT AGT GAG GCC ACT GTC TAC TTG CCT CCT GTC 4 8 

Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 
1 5 10 15 

CCA GTA TCT AAG GTT GTA AGC ACG GAT GAA TAT GTT GCA CGC ACA AAC 96 
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn 
20 25 30 
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ATA TAT TAT CAT GCA GGA ACA TCC AGA CTA CTT GCA GTT GGA CAT CCC 
lie Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro 
35 40 45 

TAT TTT CCT ATT AAA AAA CCT AAC AAT AAC AAA ATA TTA GTT CCT AAA 
Tyr Phe Pro lie Lys Lys Pro Asn Asn Asn Lys lie Leu Val Pro Lys 
50 55 60 

GTA TCA GGA TTA CAA TAC AGG GTA TTT AGA ATA CAT TTA CCT GAC CCC 
Val Ser Gly Leu Gin Tyr Arg Val Phe Arg lie His Leu Pro Asp Pro 
65 70 75 80 

AAT AAG TTT GGT TTT CCT GAC ACC TCA TTT TAT AAT CCA GAT ACA CAG 
Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gin 
85 90 * 95 

CGG CTG GTT TGG GCC TGT GTA GGT GTT GAG GTA GGT CGT GGT CAG CCA 
Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gin Pro 
100 105 110 

TTA GGT GTG GGC ATT AGT GGC CAT CCT TTA TTA AAT AAA TTG GAT GAC 
Leu Gly Val Gly lie Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp 
115 120 125 

ACA GAA AAT GCT AGT GCT TAT GCA GCA AAT GCA GGT GTG GAT AAT AGA 
Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Gly Val Asp Asn Arg 
13 0 13 5 14 0 

GAA TGT ATA TCT ATG GAT TAC AAA CAA ACA CAA TTG TGT TTA ATT GGT 
Glu Cys lie Ser Met Asp Tyr Lys Gin Thr Gin Leu Cys Leu lie Gly 
145 150 155 ~ 160 

TGC AAA CCA CCT ATA GGG GAA CAC TGG GGC AAA GGA TCC CCA TGT ACC 
Cys Lys Pro Pro lie Gly Glu His Trp Gly Lys Gly Ser Pro Cys Thr 
165 170 175 

AAT GTT GCA GTA AAT CCA GGT GAT TGT CCA CCA TTA GAG TTA ATA AAC 
Asn Val Ala Val Asn Pro Gly Asp Cys Pro Pro Leu Glu Leu lie Asn 
180 185 190 

ACA GTT ATT CAG GAT GGT GAT ATG GTT GAT ACT GGC TTT GGT GCT ATG 
Thr Val lie Gin Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met 
195 200 " 205 

GAC TTT ACT ACA TTA CAG GCT AAC AAA AGT GAA GTT CCA CTG GAT ATT 
Asp Phe Thr Thr Leu Gin Ala Asn Lys Ser Glu Val Pro Leu Asp lie 
210 215 220 

TGT ACA TCT ATT TGC AAA TAT CCA GAT TAT ATT AAA ATG GTG TCA GAA 
Cys Thr Ser He Cys Lys Tyr Pro Asp Tyr He Lys Met Val Ser Glu 
225 230 235 240 

CCA TAT GGC GAC AGC TTA TTT TTT TAT TTA CGA AGG GAA CAA ATG TTT 
Pro Tyr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gin Met Phe 
245 250 255 

GTT AGA CAT TTA TTT AAT AGG GCT GGT GCT GTT GGT GAA AAT GTA CCA 
Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro 
260 265 " 270 

GAC GAT TTA TAC ATT AAA GGC TCT GGG TCT ACT GCA AAT TTA GCC AGT 
Asp Asp Leu Tyr He Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser 
275 280 285 



- 30- 



TCA AAT TAT TTT CCT ACA CCT AGT GGT TCT ATG GTT ACC TCT GAT GCC 
Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala 



CAA ATA TTC AAT AAA CCT TAT TGG TTA CAA CGA GCA CAG GGC CAC AAT 
Gin lie Phe Asn Lys Pro Tyr Trp Leu Gin Arg Ala Gin Gly His Asn 
305 310 315 320 

AAT GGC ATT TGT TGG GGT AAC CAA CTA TTT GTT ACT GTT GTT GAT ACT 
Asn Gly lie Cys Trp Gly Asn Gin Leu Phe Val Thr Val Val Asp Thr 
325 330 335 

ACA CGC AGT ACA AAT ATG TCA TTA TGT GCT GCC ATA TCT ACT TCA GAA 
Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala He Ser Thr Ser Glu 
340 345 350 

ACT ACA TAT AAA AAT ACT AAC TTT AAG GAG TAC CTA CGA CAT GGG GAG 
Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu 
355 360 * 365 

GAA TAT GAT TTA CAG TTT ATT TTT CAA CTG TGC AAA ATA ACC TTA ACT 
Glu Tyr Asp Leu Gin Phe He Phe Gin Leu Cys Lys He Thr Leu Thr 
370 375 380 

GCA GAC GTT ATG ACA TAC ATA CAT TCT ATG AAT TCC ACT ATT TTG GAG 
Ala Asp Val Met Thr Tyr He His Ser Met Asn Ser Thr He Leu Glu 
385 390 395 400 

GAC TGG AAT TTT GGT CTA CAA CCT CCC CCA GGA GGC ACA CTA GAA GAT 
Asp Trp Asn Phe Gly Leu Gin Pro Pro Pro Gly Gly Thr Leu Glu Asp 
405 410 415 

ACT TAT AGG TTT GTA ACC TCC CAG GCA ATT GCT TGT CAA AAA CAT ACA 
Thr Tyr Arg Phe Val Thr Ser Gin Ala He Ala Cys Gin Lys His Thr 
420 425 " 430 

CCT CCA GCA CCT AAA GAA GAT CCC CTT AAA AAA TAC ACT TTT TGG GAA 
Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu 
435 440 445 

GTA AAT TTA AAG GAA AAG TTT TCT GCA GAC CTA GAT CAG TTT CCT TTA 
Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gin Phe Pro Leu 
450 455 460 

GGA CGC AAA TTT TTA CTA CAA GCA GGA TTG AAG GCC AAA CCA AAA TTT 
Gly Arg Lys Phe Leu Leu Gin Ala Gly Leu Lys Ala Lys Pro Lys Phe 
465 470 475 480 

ACA TTA GGA AAA CGA AAA GCT ACA CCC ACC ACC TCA TCT ACC TCT ACA 
Thr Leu Gly Lys Arg Lys Ala Thr Pro Thr Thr Ser Ser Thr Ser Thr 
485 490 495 

ACT GCT AAA CGC AAA AAA CGT AAG CTG TAA 
Thr Ala Lys Arg Lys Lys Arg Lys Leu * 
500 505 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



- 31 - 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 
1 5 10 15 

Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn 
20 25 30 

lie Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro 
35 40 45 

Tyr Phe Pro lie Lys Lys Pro Asn Asn Asn Lys lie Leu Val Pro Lys 
50 55 60 

Val Ser Gly Leu Gin Tyr Arg Val Phe Arg lie His Leu Pro Asp Pro 
65 70 75 80 

Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gin 
85 90 95 

Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gin Pro 
100 105 no 

Leu Gly Val Gly lie Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp 
115 120 125 

Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Gly Val Asp Asn Arg 
130 135 140 

Glu Cys lie Ser Met Asp Tyr Lys Gin Thr Gin Leu Cys Leu lie Gly 
145 150 155 ' 160 

Cys Lys Pro Pro He Gly Glu His Trp Gly Lys Gly Ser Pro Cys Thr 
165 170 " 175 

Asn Val Ala Val Asn Pro Gly Asp Cys Pro Pro Leu Glu Leu He Asn 
180 185 190 

Thr Val He Gin Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met 
195 200 205 

Asp Phe Thr Thr Leu Gin Ala Asn Lys Ser Glu Val Pro Leu Asp He 
210 215 220 

Cys Thr Ser He Cys Lys Tyr Pro Asp Tyr He Lys Met Val Ser Glu 
225 230 235 240 

Pro Tyr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gin Met Phe 
245 250 255 

Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro 
260 265 " 270 

Asp Asp Leu Tyr He Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser 
275 280 285 

Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala 
290 295 300 
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Gin lie Phe Asn Lys Pro Tyr Trp Leu Gin Arg Ala Gin Gly His Asn 
305 310 315 320 

Asn Gly He Cys Trp Gly Asn Gin Leu Phe Val Thr Val Val Asp Thr 
325 330 335 

Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala He Ser Thr Ser Glu 
340 345 350 

Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu 
355 360 365 

Glu Tyr Asp Leu Gin Phe He Phe Gin Leu Cys Lys He Thr Leu Thr 
370 375 380 

Ala Asp Val Met Thr Tyr He His Ser Met Asn Ser Thr He Leu Glu 
385 390 395 400 

Asp Trp Asn Phe Gly Leu Gin Pro Pro Pro Gly Gly Thr Leu Glu Asp 
405 410 " 415 

Thr Tyr Arg Phe Val Thr Ser Gin Ala He Ala Cys Gin Lys His Thr 
420 425 430 

Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu 
435 440 " 445 

Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gin Phe Pro Leu 
450 455 460 

Gly Arg Lys Phe Leu Leu Gin Ala Gly Leu Lys Ala Lys Pro Lys Phe 
465 470 475 480 

Thr Leu Gly Lys Arg Lys Ala Thr Pro Thr Thr Ser Ser Thr Ser Thr 
485 490 495 

Thr Ala Lys Arg Lys Lys Arg Lys Leu * 
500 505 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .29 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATG CAT GGA GAT ACA CCT ACA TTG CAT GAA TAT ATG TTA GAT TTG CAA 
Met His Gly Asp Thr Pro Thr Leu His Glu Tyr Met Leu Asp Leu Gin 
15 10 15 

CCA GAG ACA ACT GAT CTC TAC TGT TAT GAG CAA TTA AAT GAC AGC TCA 
Pro Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gin Leu Asn Asp Ser Ser 
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GAG GAG GAG GAT GAA ATA GAT GGT CCA GCT GGA CAA GCA GAA CCG GAC 14 4 

Glu Glu Glu Asp Glu lie Asp Gly Pro Ala Gly Gin Ala Glu Pro Asp 
35 ' 40 45 

AGA GCC CAT TAC AAT ATT GTA ACC TTT TGT TGC AAG TGT GAC TCT ACG 192 
Arg Ala His Tyr Asn lie Val Thr Phe Cys Cys Lys Cys Asp Ser Thr 
50 55 60 

CTT CGG TTG TGC GTA CAA AGC ACA CAC GTA GAC ATT CGT ACT TTG GAA 24 0 

Leu Arg Leu Cys Val Gin Ser Thr His Val Asp lie Arg Thr Leu Glu 
65 ' 70 75 80 



GAC CTG TTA ATG GGC ACA CTA GGA ATT GTG TGC CCC ATC TGT TCT CAG 288 
Asp Leu Leu Met Gly Thr Leu Gly lie Val Cys Pro lie Cys Ser Gin 
85 90 95 



AAA CCA TAA 297 
Lys Pro * 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met His Gly Asp Thr Pro Thr Leu His Glu Tyr Met Leu Asp Leu Gin 
15 10 15 



Pro Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gin Leu Asn Asp Ser Ser 
20 25 30 

Glu Glu Glu Asp Glu lie Asp Gly Pro Ala Gly Gin Ala Glu Pro Asp 
35 40 45 



Arg Ala His Tyr Asn lie Val Thr Phe Cys Cys Lys Cys Asp Ser Thr 
50 55 60 

Leu Arg Leu Cys Val Gin Ser Thr His Val Asp lie Arg Thr Leu Glu 
65 70 75 80 

Asp Leu Leu Met Gly Thr Leu Gly lie Val Cys Pro lie Cys Ser Gin 
85 90 95 

Lys Pro * 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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CCCCGATATC GCCTTTAATG TATAAATCGT CTGG 34 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCCCGATATC TCAAATTATT TTCCTACACC TAGTG 35 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AAAGATATCT TGTAGTAAAA ATTTGCGTCC TAAAGGAAAC 4 0 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
AAAGATATCT AATCTAC CTC TACAACTGCT AAACGCAAAA AACG 44 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

AAAAGATATC ATGCATGGAG ATACAC CTAC ATTGC 35 " 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
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(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTTTGATATC GGCTCTGTCC GGTTCTGCTT GTCC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTTGATATC CTTGCAACAA AAGGTTACAA TATTGTAATG GGCC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAAAGATATC TGGTTTCTGA GAACAGATGG GGCAC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base paxrs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTTTGATATC GATTATGAGC AATTAAATGA CAGCTCAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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TTTTGATATC GTCTACGTGT GTGCTTTGTA CGCAC 3 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTTATCGATA TCGGTCCAGC TGGACAAGCA GAACCGGAC 39 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TTTTGATATC GATGCCCATT ACAATATTGT AACCTTTTG 39 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . .294 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 

ATG AGT CTT CTA ACC GAG GTC GAA ACG CTT ACC AGA AAC GGA TGG GAG 4 8 

Met Ser Leu Leu Thr Glu Val Glu Thr Leu Thr Arg Asn Gly Trp Glu 
15 10 15 

TGC AAA TGC AGC GAT TCA AGT GAT CCT CTC ATT ATC GCA GCG AGT ATC 96 • 

Cys Lys Cys Ser Asp Ser Ser Asp Pro Leu lie lie Ala Ala Ser lie 
20 25 30 

ATT GGG ATC TTG CAC TTG ATA TTG TGG ATT TTT TAT CGT CTT TTC TTC 144 
lie Gly lie Leu His Leu lie Leu Trp lie Phe Tyr Arg Leu Phe Phe 
35 40 45 



AAA TGC ATT TAT CGT CGC CTT AAA TAC GGT TTG AAA AGA GGG CCT TCT 192 
Lys Cys lie Tyr Arg Arg Leu Lys Tyr Gly Leu Lys Arg Gly Pro Ser 
50 55 60 
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ACG GAA GGA GCG CCT GAG TCT ATG AGG GAA GAA TAT CGG CAG GAA CAG 
Thr Glu Gly Ala Pro Glu Ser Met Arg Glu Glu Tyr Arg Gin Glu Gin 
65 70 75 " 80 

CAG AGT GCT GTG GAT GTT GAC GAT GTT CAT TTT GTC AAC ATA GAG CTG 
Gin Ser Ala Val Asp Val Asp Asp Val His Phe Val Asn lie Glu Leu 
85 90 95 

GAG TAA 
Glu * 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ser Leu Leu Thr Glu Val Glu Thr Leu Thr Arg Asn Gly Trp Glu 
1 5 10 15 

Cys Lys Cys Ser Asp Ser Ser Asp Pro Leu He He Ala Ala Ser He 



He Gly He Leu His Leu lie Leu Trp He Phe Tyr Arg Leu Phe Phe 
35 40 45 

Lys Cys He Tyr Arg Arg Leu Lys Tyr Gly Leu Lys Arg Gly Pro Ser 
50 55 60 

Thr Glu Gly Ala Pro Glu Ser Met Arg Glu Glu Tyr Arg Gin Glu Gin 
65 70 75 " 80 

Gin Ser Ala Val Asp Val Asp Asp Val His Phe Val Asn He Glu Leu 
85 90 95 

Glu * 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTTGATATC GATATGGAAT GGCTAAAGAC AAGAC CAATC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
TTTTGATATC GTTGTTTGGA TCCCCATTCC CATTG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
GTTATGACAT ACATACATTC TATG 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CCATGCATTC CTGCTTGTAG TAAAAATTTG CGTCC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
CTACAAGCAG GAATGCATGG AGATACACC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
CATCTGAAGC TTAGTAATGG GCTCTGTCCG GTTCTG 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
CATCTGAAGC TTATCAATAT TGTAATGGGC TCTGTCCG 
(2) INFORMATION FOR SEQ ID NO.-2S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CATCTGAAGC TTACTTGCAA CAAAAGGTTA CAATATTGTA ATGGGCTCTG TCCG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CATCTGAAGC TTAAAGCGTA GAGTCACACT TGCAACAAAA GGTTACAATA TTGTAATGGG 
CTCTGTCCG 

(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CATCTGAAGC TTATTGTACG CACAACCGAA GCGTAGAGTC ACACTTG 



